■9 f 



PAGE: 1 RAW SEQUENCE LISTING DATE: 10/21/1999 

PATENT APPLICATION US/09/397,558 TIME: 09:32:35 

INPUT SET: S33714.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



ENTERED 



1 SEQUENCE LISTING 

2 

3 (1) General Information: 
4 
5 

6 (i) APPLICANT: Lai, Preeti 

7 Guegler, Karl J. 

8 Corley, Neil C. 
9 

10 (ii) TITLE OF INVENTION: PROSTATE GROWTH -ASSOCIATED MEMBRANE PROTEINS 

11 

12 

13 (iii) NUMBER OF SEQUENCES: 7 

14 

15 (iv) CORRESPONDENCE ADDRESS: 

16 (A) ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

17 (B) STREET: 3174 PORTER DRIVE 

18 (C) CITY: PALO ALTO 

19 (D) STATE: CALIFORNIA 

20 (E) COUNTRY: USA 

21 (F) ZIP: 94304 
22 

23 

24 (v) COMPUTER READABLE FORM: 

25 (A) MEDIUM TYPE: Floppy disk 

2 6 (B) COMPUTER: IBM PC compatible 

27 (C) OPERATING SYSTEM: PC-DOS/MS -DOS 

28 (D) SOFTWARE: Word Perfect 6.1 for Windows /MS -DOS 6.2 
29 

30 

31 (vi) CURRENT APPLICATION DATA: 

32 (A) APPLICATION NUMBER: 09/397,558 

33 (B) FILING DATE: 

34 (C) CLASSIFICATION: 
35 

36 (vii) PRIOR APPLICATION DATA: 

37 (A) APPLICATION NUMBER: 09/083,521 

38 (B) FILING DATE: 
39 

40 (viii) ATTORNEY/AGENT INFORMATION: 

41 (A) NAME: CERRONE, MICHAEL C. 

42 (B) REGISTRATION NUMBER: 3 9,132 

43 (C) REFERENCE/DOCKET NUMBER: PF-0527 US 
44 

45 

46 (ix) TELECOMMUNICATION INFORMATION: 
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RAW SEQUENCE LISTING DATE: 10/21/1999 

PATENT APPLICATION US/09/397,558 TIME: 09:32:35 

INPUT SET: S33714.raw 

(A) TELEPHONE: (650) 855-0555 

(B) TELEFAX: (650) 845-4166 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: PROSTUT10 

(B) CLONE: 1691243 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



Met 


Val 


His 


Val 


Ala 
5 


Tyr 


Ser 


Leu 


Cys 


Leu 
10 


Pro 


Met 


Arg 


Arg 


Ser 
15 


Glu 


Arg 


Tyr 


Leu 


Phe 
20 


Leu 


Asn 


Met 


Ala 


Tyr 
25 


Gin 


Gin 


Val 


His 


Ala 
30 


Asn 


He 


Glu 


Asn 


Ser 


Trp 


Asn 


Glu 


Glu 


Glu 


Val 


Trp Arg 


He 


Glu 










35 










40 










45 


Met 


Tyr 


He 


Ser 


Phe 
50 


Gly 


He 


Met 


Ser 


Leu 
55 


Gly 


Leu 


Leu 


Ser 


Leu 
60 


Leu 


Ala 


Val 


Thr 


Ser 
65 


He 


Pro 


Ser 


Val 


Ser 
70 


Asn 


Ala 


Leu 


Asn 


Trp 
75 


Arg 


Glu 


Phe 


Ser 


Phe 
80 


He 


Gin 


Ser 


Thr 


Leu 
85 


Gly 


Tyr 


Val 


Ala 


Leu 
90 


Leu 


He 


Ser 


Thr 


Phe 
95 


His 


Val 


Leu 


He 


Tyr 
100 


Gly 


Trp 


Lys 


Arg 


Ala 
105 


Phe 


Glu 


Glu 


Glu 


Tyr 
110 


Tyr 


Arg 


Phe 


Tyr 


Thr 
115 


Pro 


Pro 


Asn 


Phe 


Val 
120 


Leu 


Ala 


Leu 


Val 


Leu 
125 


Pro 


Ser 


He 


Val 


He 
130 


Leu 


Asp 


Leu 


Leu 


Gin 
135 


Leu 


Cys 


Arg 


Tyr 


Pro 
140 


Asp 





















(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRSTTUT03 
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PATENT APPLICATION US/09/397,558 TIME: 09:32:36 
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100 






(B) CLONE 


: 1999442 














i oi 






























102 
1 f)^ 




(xi) SEQUENCE DESCRIPTION: 


SEQ 


ID NO: 2 : 






104 


Met 


Phe 


Leu 


Pro 


Pro 


Val 


Val 


Leu 


Ala 


He 


Arg 


Ser 


Arg Tyr 


Val 


1 0*5 

-L vJ 










5 










10 








i r 


106 


Leu 


Glu 


Ala 


Ala 


Val 


Tyr 


Thr 


Phe 


Thr 


Met 


Phe 


Phe 


Ser Thr 


Phe 


X \J 1 










20 










25 








1 0 

j \j 


108 


Tvr 

xyx 


His 


Ala 


Cys 


Asp 


Gin 


Pro 


Gly 


He 


Val 


Val 


Phe 


Cys lie 


Met 


109 










35 










40 








45 


110 


Asp 


Tyr 


Asp 


Val 


Leu 


Gin 


Phe 


Cys 


Asp 


Phe 


Leu 


Gly 


Ser Leu 


Met 


1 1 1 

XXX 










50 










55 










112 


Sex* 


Val 


Trp 


Val 


Thr 


Val 


He 


Ala 


Met 


Ala 


Arg 


Leu 


Gin Pro 


Val 


111 

X X o 










65 










70 








7R 


114 


Val 


Lys 


Gin 


Val 


Leu 


Tyr 


Leu 


Leu 


Glv 

uxy 


Ala 


Met 


Leu 


Leu Ser 


Met 


1 1 c 

X X 3 










80 










85 










116 


Ala 


Leu 


Gin 


Leu 


Asp 


Arg 


His 


Gly 


Leu 


Trp 


Asn 


Leu 




Pro 


XX/ 










95 










100 








X \J 


118 


Ser 


Leu 


Phe 


Ala 


Leu 


Gly 


He 


Leu 


Ala 


Thr 


Ala 


Trp 


Thr Val 


Arcr 


1 1 Q 
X X -7 










110 










115 








ion 

X^ \J 


120 


Ser 


Val 


Arg 


Arg 


Arg 


His 


Cys 


Tyr 


Pro 


Pro 


Thr 


Trp 


Arg Arg 


Trn 
x xp 


X ^ X 










125 










130 








u j 


122 


Leu 


Phe 


Tyr 


Leu 


Cys 


Pro 


Gly 


Ser 


Leu 


He 


Ala 


Gly 


Ser Ala 


Val 


1 O'X 










140 










145 








i Rn 

X -3 \J 


124 


Leu 


Leu 


Tyr 


Ala 


Phe 


Val 


Glu 


Thr 


Arg 


Asp 


Asn 


Tyr 


Phe Tyr 


He 


1 










155 










160 








1 fiR 


126 


His 


Ser 


He 


Trp 


His 


Met 


Leu 


He 


Ala 


Gly 


Ser 


Val 


Gly Phe 


Leu 


127 










170 










175 










128 


Leu 


Pro 


Pro 


Arg 


Ala 


Lys 


Thr 


Asp 


His 


Gly 


Val 


Pro 


Ser Gly 


Ala 


129 










185 










190 








1 

X J 


130 


Arg 


Ala 


Arg 


Gly 


Cys 


Gly 


Tyr 


Gin 


Leu 


Cys 


He 


Asn 


Glu Gin 


Glu 


13 1 










200 










205 








oi n 


132 


Glu 


Pro 


Gly 


Pro 


Arg 


Gly 


Pro 


Arg 




Gly 


His 


Cys 




His 


i n 

X J J 










215 










220 










134 


Leu 


Cys 


Gin 


Leu 


Arg 


Gly 


Ala 


Leu 


Gly 


Leu 


Ala 


Leu 


Arg Gly 


xyx 


IOC 
1 J J 










230 










235 










X J o 


Glu 


Cys 


Phe 


Leu 


Glu 


Phe 


Phe 


Leu 


\jx y 


Val 


Trp 


Ser 


riu XJCLL 


7\ VCX 

y 


117 










245 










250 








£JJ 


138 


Arg Arg 


Gin 


Ala 


Val 


Phe 


Leu 


Glu 


Asp 


Met 


Glu 


Ser 




«xy 


1 "IQ 










260 










265 










140 


Thr 


Gin 


Asn 


Ser 


Ser 


Arg 


Asp 


Leu 


Glu 


Pro 


Phe 


Pro 


Gly His 


Gly 


141 

X*± X 










275 










280 










142 


Glu 


Leu 


Pro 


Glu 


Gly 


Leu 


Glu 


Ser 


Pro 


Cys 


He 


Met 


Glu Ser 


Phe 


143 










290 










295 








300 


144 


Leu 


Arg 


Thr 


Gly 


Ala 


Tyr 


Ala 


Gly 


Thr 


Glu 


Ser 


Leu 


Arg Thr 


Lys 


145 










305 










310 








315 


146 


Glu 


Ser 


Leu 


Leu 


Gin 


Val 


Trp 


Ser 


Leu 


Ser 


Trp 


Asp 


Ala Glu 


Pro 


147 










320 










325 








330 


148 


Ser 


Gin 


Asp 


Met 


Asp 


Ser 


Phe 


Pro 


Gly 


Arg 


Gin 


Ser 


Pro Val 


Arg 


149 










335 










340 








345 


150 


Ser 


Thr 


Ala 


Ser 


Phe 


Gin 


Arg 


Arg 


Trp 


Ser 


Leu 


Ser 


Trp Gly 


Asn 


151 










350 










355 








360 


152 


Gin 


He 


Ser 


Arg 


Phe 


Ser 


Gin 


Arg 


Leu 


Ser 


Asn 


Ser 


Gly Leu 


Arg 
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RAW SEQUENCE LISTING DATE: 10/21/1999 

PATENT APPLICATION US/09/397,558 TIME: 09:32:36 

INPUT SET: S33714.raw 

365 370 375 

Leu Pro Ser Gin Arg Gin Arg Leu Gly Cys Ala Val Leu Trp Arg 

380 385 390 

Arg Asp Cys Arg Met Asp Gly Ala Gly Thr Gly Ala Val Trp Val 

395 400 405 

Ala Gly lie Leu Val 

410 



(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: PROSTUT10 

(B) CLONE: 1691243 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 : 



CAAGTATAGG AGATTTCCAC CTTGGTTGGA 
ATTACTAAGT TTTTTCTTCG CTATGGTCCA 
AAGGTCAGAG AGATATTTGT TTCTCAACAT 
AAACTCTTGG AATGAGGAAG AAGTTTGGAG 
GAGCCTTGGC TTACTTTCCC TCCTGGCAGT 
AAACTGGAGA GAATTCAGTT TTATTCAGTC 
TACTTTCCAT GTTTTAATTT ATGGATGGAA 
TTATACACCA CCAAACTTTG TTCTTGCTCT 
TTTGCAGCTT TGCAGATACC CAGACTGAGC 
TACTTCTTTA AAAGCGGCTG CCCATTACAT 
TGTGACTGAG TGTTGGCCAG TGAGATGAAG 
TTTCATCCCT TCATCTTGCT GCTGGGATTG 
CCAGAGGATC AAAGCCACAC CCAAAGAGTA 
TACTTCCCTA CTTCCACTGC TTTTTCCTGC 
CATGAAGTGA AAATTAATTC TTTCTGCCCT 
TGTCTGAATT AACTAGACTG CAATAATTCT 
CAATTCACAT TAAAATTGAT TTTCCATTGT 
TCTTTCATTA GATATTTTGT ATCTGCTTGG 
TGGTAATTAC TAAAACTCTG TAATCTCCAA 
TCTATCATTC TCATAGATCT GCCTTATAAA 
TAAAAAAAAA AAA 



AACCTGGTTA CAGTGTAGAA AACAGCTTGG 60 

TGTTGCCTAC AGCCTCTGCT TACCGATGAG 120 

GGCTTATCAG CAGGTTCATG CAAATATTGA 180 

AATTGAAATG TATATCTCCT TTGGCATAAT 240 

CACTTCTATC CCTTCAGTGA GCAATGCTTT 3 00 

TACACTTGGA TATGTCGCTC TGCTCATAAG 360 

ACGAGCTTTT GAGGAAGAGT ACTACAGATT 420 

TGTTTTGCCC TCAATTGTAA TTCTGGATCT 480 

TGGAACTGGA ATTTGTCTTC CTATTGACTC 540 

TCCTCAGCTG TCCTTGCAGT TAGGTGTACA 600 

TCTCCTCAAA GGAAGGCAGC ATGTGTCCTT 660 

TGGATATAAC AGGAGCCCTG GCAGCTGTCT 720 

AGGCAGATTA GAGACCAGAA AGACCTTGAC 780 

ATTTAAGCCA TTGTAAATCT GGGTGTGTTA 840 

TCAGTTCTTT ATCCTGATAC CATTTAACAC 900 

TTCTTTTGAA AGCTTTTAAA GGATAATGTG 960 

CAATTAGTTA TACTCATTTT CCTGCCTTGA 102 0 

AATATATTAT CTTCTTTTTA ACTGTGTAAT 1080 

AATATTGCTA TCAAATTACA CACCATGTTT 1140 

CATTTAAATA AAAAGTACTA TTTAATGATT 1200 

1213 



(2) INFORMATION FOR SEQ ID NO : 4: 

< 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1297 base pairs 

(B) TYPE: nucleic acid 
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RAW SEQUENCE LISTING DATE: 10/21/1999 

PATENT APPLICATION US/09/397,558 TIME: 09:32:36 

INPUT SET: S33714.raw 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY : BRSTTUT03 

(B) CLONE: 1999442 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 : 



CGGACGCGTG GGCTGCTCTG CCTGAGCAAC 
ATTCGGAGTC GATATGTGCT GGAAGCTGCA 
TTCTATCATG CCTGTGACCA GCCAGGCATC 
CTGCAGTTCT GTGATTTCCT GGGCTCCTTA 
GCTCGTTTAC AGCCCGTGGT CAAGCAGGTG 
ATGGCTCTGC AGCTTGACCG ACATGGACTC 
CTGGGGATCT TGGCCACAGC CTGGACAGTA 
CCCACGTGGC GCCGCTGGCT TTTCTACTTG 
GTCCTGCTTT ATGCTTTTGT GGAGACCCGG 
CATATGCTCA TTGCGGGCAG TGTGGGCTTC 
GGGGTCCCAT CTGGAGCCCG GGCCCGGGGC 
GAGGAGCCTG GGCCTCGTGG GCCCAGGAGG 
AGAGGGGCTT TGGGCCTGGC CCTGAGGGGA 
GTGTGGAGCC CTCTTAGAAG GAGACAGGCT 
AGGACACAAA ACTCTTCCAG GGACCTGGAG 
GGCCTGGAGT CCCCCTGCAT CATGGAGTCC 
GAGTCCCTCA GGACCAAGGA GTCCCTCCTG 
CCTTCCCAAG ACATGGATTC CTTCCCAGGG 
TTCCAGAGGA GGTGGAGTCT ATCTTGGGGA 
AGCAACTCTG GCCTCAGGCT TCCTTCCCAG 
AGGAGGGATT GCAGGATGGA TGGAGCTGGG 
GTTTGATACA GGTGGAGTCT CTGTGTCTCC 



CTCATGTTTC TGCCACCTGT GGTCCTGGCC 60 

GTCTACACCT TCACCATGTT CTTCTCCACG 120 

GTGGTTTTCT GCATCATGGA CTACGATGTG 180 

ATGTCCGTGT GGGTCACTGT CATTGCCATG 240 

CTGTATTTGC TGGGAGCTAT GCTGCTGTCC 3 00 

TGGAACCTGC TTGGACCCAG TCTCTTCGCC 360 

CGCAGCGTCC GCCGCCGGCA CTGCTACCCA 420 

TGCCCTGGCA GCCTTATTGC AGGCAGTGCC 480 

GACAACTACT TCTACATTCA CAGCATTTGG 540 

CTGCTGCCCC CTCGTGCCAA GACTGACCAC 600 

TGTGGTTACC AGCTATGCAT CAACGAGCAG 660 

GGCCACTGTC AGCAGCATCT GTGCCAGCTG 720 

TATGAATGCT TCCTAGAGTT CTTTCTGGGG 780 

GTATTTCTTG AGGACATGGA GTCTTTCTCA 840 

CCCTTCCCAG GACATGGAGA ACTTCCTGAG 900 

TTCTTAAGGA CTGGAGCCTA TGCAGGCACA 960 

CAGGTGTGGA GCCTTTCCTG GGATGCAGAG 1020 

AGACAAAGCC CTGTCAGGAG CACAGCATCT 1080 

AACCAAATTT CCAGATTTTC CCAGAGGCTC 1140 

AGGCAGCGTC TGGGCTGTGC TGTGCTGTGG 12 00 

ACTGGGGCTG TCTGGGTGGC TGGTATCCTC 1260 

ATAGAAG 12 97 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 1216498 





(xi) SEQUENCE DESCRIPTION: 


SEQ 


ID NO: 5 : 






Met 


Gly Arg Ala Met Val Val Arg 


Leu 


Gly Leu Gly Leu 


Leu 


Leu 




5 




10 




15 


Leu 


Ala Leu Leu Leu Pro Thr Gin 


He 


Tyr Cys Asn Gin 


Thr 


Ser 




20 




25 




30 


Val 


Ala Pro Phe Ser Gly Asn Gin 


Ser 


He Ser Ala Ala 


Pro 


Asn 




35 




40 




45 



SEQUENCE VERIFICATION REPORT 
PATENT APPLICATION US/09/397,558 



DATE: 10/21/1999 
TIME: 09:32:37 



INPUT SET: S33714.raw 



Original Text 



SEQUENCE MISSING ITEM REPORT 
PATENT APPLICATION US/09/397,558 



DATE: 10/21/1999 
TIME: 09:32:37 



INPUT SET: S33714.raw 



< < THERE ARE NO ITEMS MISSING > > 



PAGE: 1 



Line 



SEQUENCE CORRECTION REPORT DATE: 10/21/1999 

PATENT APPLICATION US/09/397,558 TIME: 09:32:37 

INPUT SET: S33714.raw 



Original Text Corrected Text 



